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OPTIMAL TESTING OF EQUIVALENCE HYPOTHESES 

By Joseph P. Romano 

Stanford University 

In this paper we consider the construction of optimal tests of 
equivalence hypotheses. Specifically, assume Xi, . . . ,X„ are i.i.d. with 
distribution Pg, with G R''. Let g{d) be some real- valued parame- 
ter of interest. The null hypothesis asserts g{9) ^ {a,b) versus the 
alternative g{9) £ {a,b). For example, such hypotheses occur in bioe- 
quivalence studies where one may wish to show two drugs, a brand 
name and a proposed generic version, have the same therapeutic ef- 
fect. Little optimal theory is available for such testing problems, and 
it is the purpose of this paper to provide an asymptotic optimality 
theory. Thus, we provide asymptotic upper bounds for what is achiev- 
able, as well as asymptotically uniformly most powerful test construc- 
tions that attain the bounds. The asymptotic theory is based on Le 
Cam's notion of asymptotically normal experiments. In order to ap- 
proximate a general problem by a limiting normal problem, a UMP 
equivalence test is obtained for testing the mean of a multivariate 
normal mean. 

1. Introduction. Suppose Xi, . . . , Xn are i.i.d. with distribution Pg, where 
9 is a vector in M'^. Let g{9) be some real- valued parameter of interest. The 
hypothesis testing problem we study in this paper is of the following form: 
the null hypothesis asserts g{9) ^ {a,b) versus the alternative g{9) € {a,b). 

This setup arises when trying to demonstrate equivalence (or sometimes 
called bioequivalence) of treatments. By comparing a pharmacokinetic pa- 
rameter of a new drug to the standard drug, bioequivalence is declared if the 
parameter g{9) lies in the interval (a, 6), where (a, b) is specified by a regula- 
tory agency. For example, if g{9) is the difference in treatment means, then 
equivalence corresponds to values of g{9) near zero, and so {a,b) = (—A, A) 
for some A > 0. Then, rejection of the null hypothesis is the same as declar- 
ing equivalence. By formulating the null hypothesis as g{9) ^ (—A, A), the 
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risk of marketing an alternative drug that does not behave hke the stan- 
dard drug is controhed. In some situations it may be more appropriate to 
specify equivalence by a ratio of means, and equivalence is then declared 
if the ratio is near one, so that (a, b) would be an interval containing one. 
More generally, the problem may consist of determining equivalence across 
several parameters, but only the simple real-valued case is treated here. A 
very nice recent account of testing hypotheses of equivalence is given in [17]. 
For the remainder of the paper, we assume without loss of generality that 



If P belongs to a one-parameter exponential family, then a (uniformly 
most powerful) UMP level a test exists; see [10], Theorem 6 in Chapter 3. 
More generally, if the family of distributions is strictly positive of order 3 
and other mild continuity conditions are satisfied, then a UMP test exists; 
the more general result appears as Problem 30 in Chapter 3 of [10] and is 
proved in [7]. 

Example 1.1 {Normal location model). First, suppose X is N{fi,a'^), 
the normal distribution with mean /x and variance cr^. Assume cr^ is known. 
The problem is to test > A versus < A. Applying the previously men- 
tioned result, the UMP level a test rejects if \X\ < C, where C = C{a,A,a) 
satisfies 



and $(•) is the standard normal c.d.f. 

Next, suppose Xi, . . . , Xn are i.i.d. N{fi, cj^). For testing the same hypoth- 
esis, the UMP level a test rejects if n^^'^\Xn\ < C(a, n^/^A, a). 

If a is unknown, no UMP test exists, nor do unbiasedness or invar iance 
considerations lead to an optimal test. 

Outside a small class of models, no optimality theory is available for tests 
of equivalence. Wellek [17] provides a general construction of asymptotically 
valid tests, based on some asymptotically normal estimators, but no theory is 
provided to prove optimality of such procedures. The main goal of this paper 
is to provide an asymptotic optimality theory for such problems. Specifically, 
we obtain bounds for the asymptotic power of tests of equivalence for a 
large class of models, as well as construct efficient tests that attain these 
bounds. As will be seen, the results flow from Le Cam's approach based on 
convergence of experiments; see [8]. In order for this approach to be viable, 
we need to determine an optimal test for the limiting normal experiment; this 
is accomplished in Section 2, where an exact finite sample theory uniformly 
most powerful test is derived for testing the equivalence of a linear function 
of a multivariate normal mean. 



(a,6) = (-A,A). 
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In Section 3 we consider asymptotic efficiency. We will formulate the 
asymptotic problem in two distinct ways. First, consider the case when 
the null hypothesis parameter space is the complement of a fixed interval 
(—A, A) is considered. Then, we analyze the case when this interval changes 
(and shrinks) with n. In each case, attainable upper bounds for the asymp- 
totic power of tests are obtained. The upper bounds in the two approaches 
actually differ, and we prefer the second approach. 

In fact, Janssen [6] has already considered the problem of testing equiva- 
lence in a semiparametric two-sample framework, which in many ways is a 
more difficult problem. He too considered a shrinking alternative parameter 
space. Building on the work of Pfanzagl [12, 13] and Janssen [5], his tech- 
nique also relies on a reduction to an asymptotically normal experiment. 
However, he proves optimality of the power function at a particular value 
of the functional of interest, which, in our case, corresponds to the value 
of g{9) being zero. He also imposes an asymptotic similarity condition (see 
(36) of [6]). Here, we obtain asymptotically optimal tests, uniformly over 
values of the parameters in the region for which equivalence is declared, and 
no asymptotic unbiasedness condition is imposed. Of course, we are working 
in a parametric framework in order to obtain such a clean result. But, our 
results can be used in semiparametric models by an appropriate reduction 
to a parametric least favorable submodel. Some asymptotic nonparametric 
results which differ slightly from those of Janssen [6] can be obtained from 
the author. 

Even in the normal one-sample problem with unknown variance, the prob- 
lem of testing for equivalence has a rich history, and some of the literature 
is given in Example 3.1. Our asymptotic results will apply quite generally 
to parametric models, under the weak assumption of quadratic mean differ- 
entiability. The results generalize immediately to two-sample (or s-sample) 
problems, as well as to more complicated designs (such as a crossover design), 
as long as the underlying model is smooth enough to permit convergence to a 
normal experiment, since the optimal test in the limiting normal experiment 
is completely specified in Theorem 2.1. 

2. A finite sample UMP test. Throughout, $(•) is the standard normal 
distribution function and Za satisfies ^{za) = a. Before discussing optimality 
for general models, we first need to derive the optimal test in the appropriate 
limiting normal experiment. This is obtained in the following result. 

Theorem 2.1. Suppose (Xi,...,Xfc) is multivariate normal N{fi,T,) 
with unknown mean = {fii, . . . , /Xfc)"^ cLnd known covariance matrix S ( possibly 
nonsingular) . Fix 6 > and any vector a = (ai, . . . , Ok)'^ satisfying a^Ha > 0. 



4 



J. P. ROMANO 



Consider testing 



H : ^Oj/Uj >6 vs. K : ^aj/.i, 

i=l i=l 

Then a UMP level a test exists and it rejects H if 



<6. 



k 

i=l 



where C = C{a,6,a) satisfies 

C-6 



(2) 



-C-6 



a 



■ a 



and = a^Yla. Hence, the power of this test against an alternative {fii, . . . , ^k) 
with I J2i cLilJ-il = 6' < 6 is 



C-S' 



-C-5' 



a / \ cr 

Proof. The proof will consider four cases in increasing generality. 

Case 1. Suppose = 1, so that Xi = X is N{^,a'^) and we are testing 
> 5 versus |//| < 6. Fix an alternative /i = m with \m\ < 6. Reduce the 
composite null hypothesis to a simple one via a least favorable distribution 
that places mass p on N{5,a'^) and mass 1 — p on N{—6,(t'^). The value of 
p will be chosen shortly so that such a distribution is least favorable (and 
will be seen to depend on m, a, a and 6). By the Neyman-Pearson lemma, 
the MP test of 

pN{6, a^) + (1 - p)N{-6, a^) vs. iV(m, a^) 

rejects for small values of 

pexp[-(l/(2a^))(X - 6f] + (1 - p) exp[-(l/(2^^))(X + 6)'] 
^ ' exp[-(l/(2cj2))(X-m)2] 

or, equivalently, for small values of f{X), where 

/(x) = pexp[(5 - m)X/a'^] + {l-p) exp[-{6 + m)X/a'^]. 

We can now choose p so that /(C) = f{—C), so that p must satisfy 

p exp[{5 + m)C/a'^] — exp[—{5 + m)C/a'^] 
^' 1 - p " exp[((5 - m)C/a^] - exp[-((5 - m)C/a^] ' 

Since 6 — m > and 6 + m > 0, both the numerator and denominator of 
the right-hand side of (4) are positive, so the right-hand side is a positive 
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number; but, p/{l—p) is a nondecreasing function of p with range [0,oo) as 
p varies from to 1. Thus, p is well defined. Also, observe f"{x) > for all 
X. It follows that (for this special choice of C) 

{X:fiX)<fiC)} = {X:\X\<C} 

is the rejection region of the MP test. Such a test is easily seen to be level a 
for the original composite null hypothesis because its power function is sym- 
metric and decreases away from zero. Thus, the result follows by Theorem 
6 in Section 3.7 of [10]. 

Case 2. Consider now general k, so that {Xi, . . . , X^) has mean (/ii, . . . , fi^) 
and covariance matrix S. However, consider the special case (ai,...,Ofc) = 
(1,0, ... ,0), so we are testing |;Ui| >5 versus < 6. Also, assume Xi and 
{X2, . . . , Xk) are independent, so that the first row and first column of S are 
zero except the first entry, which is o"^ (assumed positive). Using the same 
reasoning as in Case 1, fix an alternative m= (mi,...,mfc) with \mi\ < 5 
and consider testing 

pN{{6, m2,...,mfc),S) + (1 - p)iV((-(5, 7712, . . . , m^), S) 

versus A^((mi, . . . , m^), S). The likelihood ratio is, in fact, the same as (3) 
because each term is now multiplied by the density of {X2, ■ ■ ■ , X;^) (by 
independence), and these densities cancel. The UMP test from Case 1, which 
rejects when \Xi \ < C, is UMP in this situation as well. 

Case 3. As in Case 2, consider ai = 1 and aj = if i > 1, but now allow 
S to be arbitrary. Reduce the problem to Case 2 by an appropriate linear 
transformation. Simply let Yi = Xi and, for i > 1, let 

Var(Xi) 

so that Coy{Yi,Yi) = Oifi> 0. Thus, the problem of testing E{Yi) = E{Xi), 
based on Y = (Yi, . . . ,1^), is in the form studied in Case 2, and the UMP 
test rejects for small |Yi| = l-'^il- 

Case 4. Now, consider arbitrary (ai,...,afc) satisfying a-^Sa > 0. Let 
Z = OX, where O is any orthogonal matrix with first row (ai,...,afc). 
Then E{Zi) = J2i=i(^if^ii and the problem of testing |ii^(Zi)| >6 versus 
|-E(Zi)| < 5 reduces to Case 3. Hence, the UMP test rejects for small values 
of |^i| = |Eiia*^i|- □ 

Next, we summarize some simple but useful properties of the critical con- 
stants C{a,6,a) and the optimal power of the above UMP test that will be 
used later. 
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Remark 2.1. It is easy to check that, as a function of C, the function 
h{C) given by 

is increasing in C. Since 

h{6 — azi-a) = a — (t{—2a~^5 + zi-a) < a, 

it follows that 

(5) C{a,6,a) > 6 — azi-a- 
Remark 2.2. The function C{a,6,a) satisfies 

(6) ^^^ = c(a,^,iy 
It is also easy to check that 

C{a,e,l) Z(i_a/2) 
as e — > and C{a, B ,1) ^ oo as B ^ oo. 

Remark 2.3. For fixed C = C{a, 6, a), the function 

/(7)=$(C-7)-^(-C-7) 

is decreasing in 7; to see why, just differentiate /. So if < 7 < 5, then 
/(7)>a. 

3. Asymptotic optimality. Throughout this section, we assume Xi , . . . , X„ 
are i.i.d. according to a distribution Pg, with G 0, where ft is an open sub- 
set of M'^. Here the observations take values in a sample space X. Assume 
Pg has density pg with respect to /i. We will assume the family is quadratic 
mean differentiable (q.m.d.) at certain values of 6; that is, there exists a 
vector of real-valued functions 7]{-,9o) = {r]i{-,6o), . . . ,%(-,0o))^ such that 



(7) / [Vp0o+h{x) - Vpeoi^:) - {r]{x,eo),h)] dfi{x) = o{\h\^) 

JX 

as \h\ ^0. Here /i is a vector in R*^ and \h\ denotes its Euclidean norm. For 
such a family the Fisher information matrix at is the matrix I{9o) with 
entry 



Iij{9o)=^ / r]i{x,9o)r]j{x,0o)dfi{x). 
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We also define the score vector Z„ to be 

n 

(8) Zn = ZM = 2n-^/^Y.^r^{XM/pl[\Xi)]. 

1=1 

For a review of families that are q.m.d., as well as the history and importance 
of this notion, see [9]. In particular, we make heavy use of the fact that such 
families are locally asymptotically normal, and so the testing problem under 
consideration can be approximated by a certain normal testing problem. 

Interest focuses on g{0), where g is a function from O to M. Assume 
g is differentiable with gradient vector g(9) of dimension 1 x fc. We will 
formulate the problem in two distinct ways. First, we consider the case 
when the null hypothesis parameter space is the complement of a fixed 
interval (—A, A). Then, we study the case when this interval changes (and 
shrinks) with n. 

3.1. Fixed parameter spaces. Fix A > 0. The problem is to test \g{0)\ > 
A versus \g{0)\ < A. We implicitly assume g is such that there exists a 6 
such that g{9) > A, as well as a with g{9) < —A. For any fixed alternative 
value 6 with \g{0)\ < A, the power of any reasonable test against 6 will 
tend to one. Therefore, as is customary (see [16], Chapters 14 and 15), 
we compare power functions at local alternatives. Consider any fixed 
satisfying 15(6*0)1 = A. For sake of argument, consider the case g(6'o) = — A. 
In order to derive an (obtainable) upper bound for the limiting power of a 
test sequence (pn under 6o + hn~^^'^ , a crude way to bound the power is based 
on the simple fact that any level a test for testing \g{0)\ > A versus \g{0)\ < 
A is also level a for testing g(9) < —A versus g{6) > — A. Since upper bounds 
for the (asymptotic) power are well known for the latter testing problem 
(as in [16], Theorem 15.4), an immediate result follows. In this asymptotic 
setup, the statistical problem is somewhat degenerate, as it becomes one of 
testing a one-sided hypothesis. For example, suppose Xi,...,Xn are i.i.d. 
N(9,l). Then for large n, one can distinguish 6 < —A and 9 > —A with 
error probabilities that are uniformly small and tend to zero exponentially 
fast with n. In essence, the statistical issue arises only if the true 9 is near 
the boundary of [—A, A], in which case determining significance essentially 
becomes one of testing a one-sided hypothesis. 

Example 3.1 {Normal one-sample problem). Suppose Xi, . . . ,Xn are 
i.i.d. N{^,a^), with both parameters unknown. Consider testing |/i| > A 
versus < A. The standard t-test for testing the one-sided hypothesis ^ < 
—A against fi> — A rejects if 

n^/^{Xn + A)/Sn>tn-l,l-a, 



8 



J. P. ROMANO 



where 5^ is the (unbiased) sample variance and tn-i^i-a is the I — a quantile 
of the ^-distribution with n — 1 degrees of freedom. Similarly, the standard 
t-test of the hypothesis /i > A rejects if 

ni/2(x„- A)/5„<-t„_i,i_„. 

The intersection of these rejection regions is therefore a level a test of the 
null hypothesis > A. Such a construction that intersects the rejection 
regions of two one-sided tests (TOST) was proposed in [18] and [15], and 
can be seen as a special case of Berger's [2] intersection-union tests; see [3] 
for a review. The resulting TOST is given by the test i^^^^'^ that rejects 
when \Xn\ < A — n~^^'^Sntn~i,i~a- The asymptotic power of cpn^^"^ against 
a sequence with mean —A + hn~^^^ {h > 0) and variance fixed at can be 
calculated directly as 

^A+hn-i/2,a{l^n| < ^ - n-^/^Sntn~l,l~a} = $ - , 

which is the optimal bound for the one-sided testing problem given in The- 
orem 15.4 of [16]. A similar calculation applies to sequences of the form 
A — /in~^/^. Thus, the TOST is asymptotically optimal in this setup. It 
should be remarked that the TOST has been criticized because it is biased 
(in finite samples) and tests have been proposed that have greater power; 
some proposals are reviewed and studied in [3], [4] and [11]. These points 
are valid, but no test can have greater asymptotic power against such local 
alternatives. On the other hand, the TOST will be seen to be inefficient 
under the asymptotic setup of the next section. 

3.2. Shrinking alternative parameter space. We now consider a second 
asymptotic formulation of the problem. The null hypothesis asserts \g{d)\ > 
alternative hypothesis asserts \g{0)\ < 5n~^/'^ . Notice now 
that, in this asymptotic study, the parameter spaces (or hypotheses) are 
changing with n. Of course, a given hypothesis testing situation deals with 
a particular n, and there is flexibility in how the problem is embedded into 
a sequence of similar problems to get a useful approximation. Indeed, if 
equivalence corresponds to \g{9)\ < A, we can always set up the problem 
by choosing 6 = An^/^. From an asymptotic point of view, it makes sense 
to allow the null hypothesis parameter space to change with n, or else the 
problem becomes degenerate in the sense that the values of A and —A for 
g{0) can be perfectly distinguished asymptotically. In testing for bioequiva- 
lence, for example, A represents a small value so that a value of \g{6)\ < A is 
deemed sufficiently close to zero in a clinical sense. In a particular situation 
(such as the previous example with a not too small), a value for \g{9)\ of A 
cannot be perfectly tested against a value of g{9) = 0. Thus, if is in some 
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sense not far from both A and —A, it follows that A and —A are not far 
from each other either, and the asymptotic setup should reflect this. 

We implicitly assume there exists some with g[6) > 0, as well as some 
6 with g{6) < 0. The main result of this section is the following theorem. 

Theorem 3.1. Suppose Xi,...,Xn are i.i.d. according to Pq, 9 Gil., 
where Q is assumed to be an open subset ofM.^. Consider testing the null 
hypothesis 

9eno,n = {0-\g{0)\>Sn'^/^} 

versus \g{0)\ < 6n~^^'^ , where the function g from M'^ to M is assumed differ- 
entiable with gradient g{9). Assume for every 9 with g{9) = that the family 
{P9,9 G i}} is q.m.d. at 6 and I[9) is nonsingular. 

(i) Let (j)n = (j)n{Xi, . . . ,Xn) be a uniformly asymptotically level a se- 
quence of tests, so that 

lim sup sup E'e (</>„) < a. 

Assume 9q satisfies 5(6*0) = 0. Then, for any h such that \ {g{9Q)'^ ,h) \ = 6' < 
S, 

(9) limsupi^e„+^„-i/2(<^„) <^(^^) - CD 
where (t|^ is given by 

(10) al=g{9o)I~\Oo)g{9of 

and C = C{a,6,aeQ) satisfies (2). 

(ii) Let 9n be any estimator satisfying 

(11) nV2(4 _ e^) = I^\9o)Zn + opn (1), 

(such as an efficient likelihood estimator). Suppose I (9) is continuous in 9 
and g{9) is continuous at 9q. Then the test sequence (pn that rejects when 
ni/2|g(6'„)| <C{a,6,an), where 

al = g{9n)L-\9n)g{9nf, 

is pointwise asymptotically level a and is locally asymptotically UMP in the 
sense that the inequality (9) is an equality. In fact, the same properties hold 
for any test sequence that rejects if |r„| < C(a,5, (T„), if satisfies 

Tn = gi9o)I~Hdo)Zn,0o + opn^il) 
for every 9q £ Qq, where Znfi^ is the score vector defined in (8). 
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Proof. Fix satisfying g{9o) = 0. We will derive an upper bound for 
the limiting power of a test sequence (f>n under 6q + hn"^/"^ . Note that 

So, if h is such that 1(5(^0)^, Ml > then |c/(0o + hn~^/'^)\ > 6n"^/^ for all 
sufficiently large n. Hence, if (pn has limiting size a, then, for such an h, 

(12) limsupSg +^„_i/2((/'n) < a. 

Since the family is q.m.d., the sequence of experiments ^^j_|_/^,„-i/2 (indexed 
by a vector h) converges to a limiting (multivariate) normal experiment 
with unknown mean vector h and known covariance matrix I~^{6q). There- 
fore, we can approximate the power of a test sequence cpn by the power 
of a test (j) = (j){X) for the (limit) experiment based on X from the model 
N{h,I'^{eo)); see Lemma 3.4.4 of [14] or Theorem 15.1 of [16]. So, let P^{h) 
denote the power function of (j){X) when X ~ iV(/i, I~^{6o)). Then (12) im- 
plies (3(j,{h) < a if \{g{6o)'^ ,h)\ > 6. By continuity of I3^{h), I3^{h) < a for 
any h with \{g{9Q)'^ ,h)\ > 5. The choice of (f) to maximize (3^{h) for this 
limiting normal problem was given in Theorem 2.1 with S = I-i((9o) and 
= (/{Oq). Thus, if (p is level a for testing \{g{9Q)'^ ,h) \ > 6 and h satisfies 
\{gidof,h)\=5'<6, then 



and C = C(a, (5, (T6)o) satisfies (2). 

To prove (ii), consider the test that rejects when n^^'^\g{9n)\ < C{a, S, (T„). 
Fix h such that \{g{9o)'^, h) \ = 6' < 6 and let 9n = 9o + hn-^/"^. Then, using 
standard contiguity arguments, under 



But 



Therefore, under 9n, 



n'/'[g{9n)-g{9n)]^{0,al). 
n^l^g{9n) = {Kg{9of) + o{l). 
n'/''g{9n)^Nah,gi9of),al) 



Also, under 9n, o"„ tends in probability to ag^^, and so C(a,5, (T„) tends 
in probability to C(a, 5, aeg). Hence, letting Z denote a standard normal 
variable, 

Pe„K/'|5(^n)| <C(a,(5,<T„)}^P{|aeoZ+(/i,5(^o)^)|<C(a,5,aeJ}, 
which agrees with the right-hand side of (9). □ 
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Example 3.2 (Normal one-sample problem, Example 3.1, continued). 
Suppose Xi,...,Xn are i.i.d. N{jj,,a'^) with both parameters unknown, so 
that 6 = {fJ,,cr). Let g{9) = ^ and consider testing |;u| > (5n~^/^ versus < 
6n~^^'^. By the previous theorem, for any test sequence (pn with Umiting size 
bounded by a and any h with \h\ <6, 

fC-h\ f-C-h\ 

(13) E^^.y.^M < [-^) - ^ [-^) ' 

where C = C{a,6,a) satisfies (2). A test whose hmiting power achieves this 
bound is given by the test c^* that rejects when 

n^/^\Xn\<C{a,6,Sn), 

where is the (unbiased) sample variance (or any consistent estimator 
of o"^). In the normal model, such an approximate test was first proposed 
by Anderson and Hauck [1] (but in a two-sample context); in essence, the 
general construction of Theorem 3.1 can be viewed as an extension of their 
method. 

On the other hand, the test t/)™^"'" given in Example 3.1 is no longer 
asymptotically efficient. This test (with A = 5n~^^'^) rejects when 

1 /2 ~ 

n l^nl <S — Sntn-l,l-a 

and has power against (fJ'jCr) = {hn~^/'^ ,a) given by 

/-..s D ( + Sntn-l,l~a — h ry S — Sntn~l,l~a — h") 
K^V ^hn-i/2,^<! <^n< j, 

where 

Zn = n^/^iXn - hn-^/^)/a ~ 7V(0, 1). 

Also, cr in probability and tn-i,i-a ^i-a- By Slutsky's theorem, 

(14) converges to 

15 P — + zi_a--<^<--2i-a--k 

L cr a a a ) 

where Z~A^(0,1). Observe that this last expression is positive only if 
czi-a < otherwise, the limiting power is zero! On the other hand, the 
limiting optimal power of is always positive (and greater than a when 
\h\ < 6). Even when the limiting power of (j)^'''^'^ is positive, it is always 
strictly less than that of (/>* . Note that the limiting expression (15) for the 
power of (pn^^^^ corresponds exactly to using a TOST test in the limiting 
experiment N{h,a'^), where you are testing \h\ > 6 versus \h\ < 6 with a 
known; such a procedure is conservative and less powerful than the UMP 
test. In general, (5) implies that 

C{a,6,an) > 6-anZi-a, 
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which shows that the test of Theorem 3.1 is always more powerful than 
the asymptotic TOST construction. 

Remark 3.1. Thus far, we have considered testing for two situations, 
first where the null hypothesis specifies \g{9)\ > A and next where \g{9)\ > 
bj'n}-!'^ . Of course, one can also consider the general situation where the null 
hypothesis is specified by \g(Qy\ > 6/Tn, where Tn — > oo. For the purposes of 
this discussion, suppose g{6) = G M. So, suppose we are testing \9\ > 5/Tn 
with T„ — > oo at a rate slower than n^^'^, so that Tn = o{n^^'^). By contiguity 
arguments, the optimal limiting power will be nondegenerate (meaning away 
from a and 1) for alternatives of the form 5/Tn — h/n^^"^ or 5/Tn + h/n^^'^ for 
h > 0, or, more generally, if h/n^^'^ is replaced by any sequence En satisfying 
e X But, if Tn = o{n^^'^), then (5/r„ and —S/Tn can be perfectly distin- 

guished, and so we are essentially in the first asymptotic setup. That is, the 
asymptotically optimal power against an alternative sequence 5/t„ — h/v}/'^ 
is the same as for testing a one-sided hypothesis 6>5/Tn versus 6 <5/Tn- 

On the other hand, suppose — > oo faster than n^/^, so that v}/^/Tn 0. 
Then 5/t„ and —5/Tn are so close that the optimal limiting power against 
any alternative sequence /i„ with |/in| < 5/Tn is a. 

Perhaps the reader can be more easily convinced of these assertions in 
the N{6, 1) model, where explicit expressions for the power of the UMP 
test exist, but the previous arguments apply to more general models. Thus, 
the two asymptotic approaches previously considered in this section are in 
essence the most general. 

Acknowledgment. Special thanks to Erich Lehmann for some helpful 
discussion. 
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